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Abstract 

We established a new algorithm for correlation process in radio astronomy. This scheme consists of the 
lst-stage Fourier Transform as a filter and the 2nd-stage Fourier Transform for spectroscopy. The "FFX" 
correlator stands for Filter and FX architecture, since the lst-stage Fourier Transform is performed as a 
digital filter, and the 2nd-stage Fourier Transform is performed as a conventional FX scheme. We developed 
the FFX correlator hardware not only for the verification of the FFX scheme algorithm but also for the 
application to the Atacama Submillimeter Telescope Experiment (ASTE) telescope toward high-dispersion 
and wideband radio observation at submillimeter wavelengths. In this paper, we present the principle of 
the FFX correlator and its properties, as well as the evaluation results with the production version. 

Key words: techniques: spectroscopic — instrumentation: spectrographs — radio continuum: 
general — radio lines: general- instrumentation: interferometers 



1. Introduction 

The signals received by the antennas obey the station- 
ary stochastic process and then ergodic process. The er- 
godic theory can be applied to the auto-correlation func- 
tion for a spectrometer and the cross-correlation func- 
tion for radio interferometer. Under such conditions, 
Wcinreb (1963) developed the first digital spectrome- 
ter. This digital spectrometer is called the XF correla- 
tor in which the correlation is calculated before Fourier 
Transform. Meanwhile, Chikada et al. (1987) developed 
the first the FX correlator of an another design, in which 
Fourier Transform is performed before cross multiplica- 
tion. Although there is a difference of property between 
two basic designs, the obtained astronomical spectra of 
them were confirmed to be identical. 

Determining the number of correlation lags in the XF 
scheme or of Fourier Transform points in the FX scheme 
is essential for the realization of high-dispersion and wide- 
band observation, because the frequency resolution is de- 
rived as 



A/ = 1/(A£ S A0 = 2B/N, 



(1) 



where At s is the sampling period, N is the number of cor- 
relation lags or Fourier Transform points, and the band- 
width of B is equal to l/(2At s ). The material size and 
cost of the correlator strongly depend on the sampling 
period, At s , and the number of correlation lags or Fourier 
Transform points, N. 

The new XF architecture with the digital Tunable 
Filter Bank that is designed with the Finite Impulse 
Response (FIR) has been proposed and developed for 
the next generation radio interferometers, the Expanded 
Very Large Array (EVLA) and the Atacama Large 
Millimeter/submillimeter Array (ALMA) (Carlson 2001, 



Escoffier et al. 2007). This is called the "FXF corre- 
lator". The architecture of the FXF scheme can make 
the material size smaller in comparison with that of the 
conventional XF scheme. Since the digital filter allows a 
variety of observation modes [scientific and observational 
availability were shown in Iguchi et al. (2004)], the FXF 
scheme will provide us with the most appropriate specifi- 
cations which meet the scientific requirements. This will 
lower the risk of over-engineering of the correlator. 

The improved FX architecture with DFT filtcrbank was 
developed by Bunton (2000). The use of polyphase fil- 
ter banks allows arbitrary filter responses to be imple- 
mented in the FX scheme (Bunton 2003). This is called 
the "Polyphase FX Correlator". This scheme has a pos- 
sibility to achieve the spectral leakage of about -120 dB. 
In particular, this performance is significant to suppress 
the leakage from the spurious lines mixed in receiving, 
down-converting or digitizing. 

The FFX Correlator is a new algorithm for correlation 
process in radio astronomy. The FFX scheme consists of 
2-stage Fourier Transform blocks, which perform the lst- 
stage Fourier Transform as a digital filter, and the 2nd- 
stage Fourier Transform to achieve higher dispersion. The 
first 'F' of the FFX is the initial letter of the word "Filter" . 
In this paper, we present a new FFX architecture. The 
principle of the FFX scheme in section 2, the properties 
of the FFX scheme in section 3, the algorithm verification 
and performance evaluation with the developed FFX cor- 
relator in sections 4 and 5, and the summary of this paper 
in section 6 are presented. 

2. Principle of the FFX Correlator 

This section shows the algorithm and the data flow dia- 
gram of the signal processing in the Fourier Transform of 
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the FFX scheme (see figure 1). Suppose that the 
digital waveforms at the correlator input from the astro- 
nomical radio signals that are received by the telescope. 
The inputs, x n , are real digital signals at sampling period 
of At s , and obey the zero- mean Gaussian random variable. 
The suffix n is an integer for time. 

[Step 1] The correlator receives the time-domain digi- 
tal sampling signals from the Analog-to-Digital Converter 
(ADC), and accumulate them up to N n points. 

[Step 2] The time-domain Appoint data are transferred 
to the frequency-domain by using the A^-point Discrete 
Complex Fourier Transform as follows: 



Table 1. Definition of functions. 



X p = At, 



E( 2irpn \ 
x n exp [-J—jy— J 



(2) 



where X is the spectrum after the 1st Fourier Transform, 
the suffix p is an integer for frequency, and At s is equal to 
l/(2Si s t) at the bandwidth of Bi st . The A/i st is the min- 
imum frequency resolution of the 1st Fourier Transform, 
which is equal to l/(At s N n ). 

[Step 3] The extraction of the points from the fre- 
quency domain AT n /2-point data after the 1st Fourier 
Transform is conducted as if filter and frequency conver- 
sion are performed simultaneously: 



XL 



(p = k + k , k = 0, 



,N k 



1) 



(3) 



where ko is the minimum frequency channel in the extrac- 
tion, and the suffix k is an integer for frequency. 

[Step 4] The A^-point data after Inverse Fourier 
Transform is written by 



AtfN k 



Afk-l 

■E 

k=0 



A^exp 



2ir(k-N k /2)l 



A/k 



(4) 



where x' is the time-domain signal after inverse Fourier 
Transform, the suffix I is an integer for time, and Ati is 
the sampling period after filtering at the bandwidth of 
Sand (Atf = 1/S 2nd = 1/A/istA/k). 

[Step 5] By repeating the procedure from Step 1 to Step 
4, the data are gathered up to N m points as follows; 



x m ~ x l+dN k 



(5) 



where m is l + dN k , and d is the number of repeating times 
of the procedure from Step 1 to Step 4. 

[Step 6] The time-domain A m -point data after gathering 
are transferred to the frequency-domain by using the Ap- 
point Discrete Complex Fourier Transform as follows: 



At f £ 



x m exp 



2wqm \ 



(6) 



where X' is the spectrum after the 2nd Fourier Transform, 
and the suffix q is an integer for frequency. The A/2 n d is 
the minimum frequency resolution after the 2nd Fourier 
Transform, which is equal to l/(Atf N m ) (=Afi st N k /N m ). 

The definition of all functions used in this section is 
summarized in table 1. Also, the summary of the relation- 
ship among the functions (see table 1) is listed in table 2. 
The following relations are derived: 



Mark Explanation 



Si s t Bandwidth of the input signals 

At s Sampling period of the input signals 

N n Number of the points of 1st FT 

A/i s t Minimum frequency resolution of 1st FT 

N k Number of the extraction times as a filter 

S 2n d Bandwidth after extraction 

Atf Sampling period after filtering 

N m Number of the points of 2nd FT 

A /2nd Minimum frequency resolution of 2nd FT 



Note that FT is Fourier Transform. 



Table 2. Relationship among the functions (see table 1). 



Equation 


(a) 


At s 


= l/(2S l8t ) 


(b) 


A/i s t 


= 2B lst /N n 






= 1/ (At s N n ) 


(c) 


At f 


= 1/S 2n d 






= l/(A/i Bt JV k ) 






= At s N n /N k 


(d) 


A/ 2nd 








= l/(At f AT m ) 






= Af lst N k /N m 






= 2B lst /N n -N k /N m 



A/ lst = 2B lst /N n , 
Af 2nd = 2B lst /N n -N k /N n 



(7) 

(8) 



The frequency resolution of the FFX scheme is determined 
by the number of of the 1st Fourier Transform points(A r n ), 
the number of the extractions as a filter (N k ), and the 
number of the 2nd Fourier Transform points(A r m ). 

3. Properties of the FFX Correlator 

The frequency responses for spectroscopic observations 
are finally derived by Fourier Transform in all schemes. 
For finite length of Fourier Transform, the responses are 
multiplied by a rectangular window function, which cor- 
responds to convolving sine function in the frequency do- 
main. The frequency profile of the XF scheme becomes the 
shape of sine function profile, while that of the FX scheme 
is sine squared function profile. This indicates the FX 
scheme (including the FFX and polyphase FX schemes) 
is better than the XF scheme (including the FXF scheme) 
from the view points of the frequency profile and sharp- 
ness of individual frequency channels, and the spectral 
leakage. 

For the realization of the high frequency resolution in 
the conventional FX scheme, the A^-point complex FFT 
may be divided into A^/2-point FFTs, Appoint twiddle fac- 
tor multiplications, and A r /2-point second FFTs, because 
the circuit size of the LSI (Large-Scale Integration) is lim- 
ited (Iguchi et al. 2002). The memory and circuit for 
the twiddle factor multiplications are critical. However, 
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[Step 6] Frequency domain Nm-point data after 2nd Fourier Transform 
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Fig. 1. Algorithm and the data flow diagram of the FFX scheme. 
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the high frequency resolution can be realized in the FFX 
scheme without the twiddle factor multiplications. 

The FFX scheme has advantages of selectivity for fre- 
quency resolution and bandwidth in comparison with 
other schemes. The comparable functions are also real- 
ized by implementing a digital LO circuits (Escoffier et 
al. 2007). The digital LO circuits need to be delicately 
designed to avoid the spurious in mixing the digital LO 
signals due to the rounding errors in the calculation pro- 
cess. 

For the investigation of aliasing or foldover from fre- 
quencies over the bandedge in the FFX scheme, it is nec- 
essary and important to estimate the frequency response. 
As shown in [Step 3] in figure 1, the desired frequency 
response, equation (3) can be rewritten as 



X k = X' (p = k + k , fc = 0,---,7V k -l), 



Hp ■ X p , 



1 (p = fc ,"-,fco + iVk-l) 
( P <k , p>k +N k -l). 



(9) 
(10) 

(11) 



However, "-ffk " should be replaced by zero to reduce 
the aliasing or foldover of noise from frequencies over the 
bandedge. Thus, in the FFX scheme, equation (11) need 
to be replaced as 



1 (p = k 
(p < k 



l,---,ko+N k -l) 
1, p>k + N k -l) 



(12) 



According to the two (1st and inverse) Fourier Transforms 
at the same resolution, the actual designed transfer func- 
tion of the impulse response, which is derived with the 
square of sine function, is represented as follows: 




sin[7r(iV n Af B /-p)] 
n(N n At s f-p) 



am[n(N D At B f-p)] y 
n(N a AUf-p) J 



(13) 



(14) 



where / is an arbitrary frequency, and also the response 
of a sine function is caused by one Fourier Transform. 
It can be confirmed that the designed transfer function 
approaches the desired frequency response by increasing 
N n while keeping N n /N k constant. The filtering process 
in the FFX scheme is similar to that used in the design 
method of a Frequency Sampling Filter, FSF (Rabiner 
and Schafcr 1971). 

Also, to improve the frequency response, the window 
function before 1st Fourier Transform can be multiplied. 
In that case, equation (14) should be written as 



Wf)\ = 



N n -1 

£ 

p=0 

P(f) 



H^W(N n AtJ-p) 



sm[n(N n At s f-p)] 
Tr(N n At s f-p) ' 



'JV n -l 

J2 H 2 v W{N n At s f-p) 



sin[n(N a At B f-p)] 
n(N n AtJ-p) 



(15) 



, (16) 



where W(/) is the response after the window function 
w(n) is transferred to the frequency-domain by Fourier 
Transform. If the window is a rectangular window func- 
tion, W(f) becomes a sine function. In that case, equa- 
tion (16) consists with equation (14). There are the fol- 
lowing famous window functions: Hanning, Hamming, 
Blackman, and Kaiser. By well making a choice of the 
window function, the first sidelobe of stopband in the fre- 
quency response will be improved. It is well known that 
the first sidelobe levels with Backman and Kaiser as the 
window are better than those Hanning and Hamming. For 
the FFX scheme, it is found that the Bessel function of 
zeroth order Jo is better than others. This frequency re- 
sponse is shown in figure 2. The first and second sidelobe 
levels is about -34 dB, the fifth and sixth sidelobe levels is 
about -50 dB, and higher-order sidelobe levels will be bet- 
ter than -60 dB (see figure 2a). The stopband response of 
the FFX scheme is not better than that of the polyphase 
FX scheme (Bunton 2000) and that of the FXF scheme 
for EVLA (Carlson 2001). On the other hand, the ripple 
response in the passband is less than 0.4 dB peak-to-peak 
(see figure 2b). This performance is better than that of 
the polyphase FX scheme. 

Note that the stopband performance to suppress the 
spurious lines can be improved by installing "the detec- 
tion and cancellation techniques of high and low frequency 
spurious lines" into the FFX scheme (Chikada found this 
algorithm in the development of ALMA/ AC A correlator). 



Requirements and Specifications for 
Development of the FFX Correlator 



the 



The development of the FFX correlator hardware is sig- 
nificant for the verification of the FFX scheme algorithm. 
It is necessary to define the requirements and specifica- 
tions of the hardware of the FFX correlator. The hard- 
ware size can be optimized for the verification of algo- 
rithm. On the other hand, there were the scientific re- 
quests for the application to the Atacama Submillimctcr 
Telescope Experiment (ASTE), which is a new project 
to install and operate a 10-m submillimctcr telescope at 
a high latitude site (4,800 m) in the Atacama desert in 
northern Chile (Ezawa et al. 2004). Under these sit- 
uations, the FFX correlator hardware for the algorithm 
verification was specified by also considering the scientific 
requirements for submillimeter astronomy including the 
application to ASTE. 

In case of spectroscopic observations of atomic / molec- 
ular line emissions, their line width are extended by the 
Dopplcr shift along a line of sight with movement of inter 
stellar matter (ISM). In nuclear regions of external galax- 
ies and Ultra Luminous Infrared Galaxies (ULIRGs), the 
molecular clouds which have various velocity components 
can be observed simultaneously, and line widths of the 
observed atomic / molecular emission lines are extended. 
For example, the line width of CO line emission of external 
galaxy is sometimes extended to more than 800 km s _1 
(i.e. Narayanan et al. 2005), which corresponds to about 
920 MHz in 12 CO(J=3-2) (u Iest ~ 345.796 GHz) and about 
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Fig. 2. The frequency response of 1st FFT stage in the FFX scheme, which works as a digital filter, (a) is the frequency response 
at the full range to investigate the stopband, and (b) is the closeup frequency response in the passband to investigate a ripple. The 
frequency response is multiplied by the window of the Bessel function, w(n) = Jq ( 4 ^ 4n j . 



Table 3. FFT segment length. 



FFT stage 


FFT segment length 


First stage FFT 


1024 points 


Inverse FFT 


8 points 


Second stage FFT 


4096 points 



2.2 GHz in 12 CO(J=7-6) (i/ re8t - 806.652 GHz), by rotat- 
ing around its nuclear region. On the other hand, in order 
to evaluate the kinematics of protoplanctary disks and the 
internal structure of molecular clouds in the Milky Way, it 
is also necessary to resolve their thermal line widths using 
spectrometer with high frequency dispersion; for instance, 
frequency resolution of 32 kHz corresponds to velocity res- 
olution of 0.032 km s _1 at 1 mm wave length. 

As the full bandwidth of more than 3 GHz is required, 
the FFX correlator need to achieve the processing speed 



of 8192 Mega sample per second (Msps). In that case, for 
the realization of two-type spectral resolutions of about 5 
MHz and less than 32 kHz, the FFX correlator must meet 
at least the specifications in table 3. 

5. Development and Evaluation of the FFX 
Correlator 

The correlation processing block diagram of the FFX 
correlator is shown in figure 3. The FFX correlator 
consists of DTS-R (Data Transmission System Receiver) 
module, Correlation Module, and the Monitor & Control 
Computer. In the DTS-R module, the DTS-R Board or 
the ADC (Analog-to-Digital Converter) Board is imple- 
mented as an EIB (Electrical input Interface Board) . The 
input data rate of the FFX correlator is about 48 Giga bit 
per second (Gbps) with 3-bit quantization at the sampling 
frequency of 8192 or 4096 Msps, which is 8192 Msps x 3 
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Fig. 3. Correlation processing block diagram of the FFX scheme. 



bits x 2 IF or 4096 Msps x 3 bits x 4 IFs. The DCDCB 
(Delay Correction and Data Configuration Board) effec- 
tively distributes the input signals to the next boards for 
the parallel correlation processing. For the data process- 
ing at the throughput of 8192 Msps, the correlation is per- 
formed with 16 parallels, and both of the real and imag- 
inary parts are used in FFT. The data is sent to each 
16-parallel CORB (Correlation Board) per one-segment 
length. In the correlation mode of the FFX scheme, the 
data is sent per total segment length which is determined 
considering the signal process including the second stage 
of FFT. Final correlation output is obtained by adding 
16-parallel correlation results. The correlation output is 
sent to the Monitor & Control Computer via LAN cable. 
Switching between the FX processing and the FFX pro- 
cessing is normally operated by setting the command into 
the Monitor & Control Computer. 

The correlation processing flow of the FFX correlator 
is shown in figure 4. All main logics are implemented in 
FPGAs (Field Programmable Gate Array). 

5.1. Delay Correction and Data Configuration Board 
(DCDCB) 

5.1.1. Delay Correction 

The FFX correlator has a delay correction circuit per bit 
for every 3-bit sampling signal. Delay tracking for every 
single bit is realized by extracting 64-sample length from 
time-sequential 128 samples that are produced by splitting 
the output from FIFO (First In, First Out) memory in 
two, and shifting one side or the other in one-clock phase 
(see figure 5). Delay correction circuit has a FIFO memory 
of 1.024 Mega samples to each bit for delay tracking. In 
case of 8192 Msps, the delay correction range is ± 512 kilo 
samples, that is ± 62.5 /xsec (=± 18.75 km). 

5.1.2. Data Distribution 

Given the operation speed of the device (FPGA etc.), 
16-parallel correlation processing is essential to achieve 
the throughput of 8192 Msps. To do this, the input sig- 
nals are divided by one segment length of FFT and sent 
to the Correlation Board (CORB) so that each parallel 
processing is performed separately. 
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Fig. 6. Data Configuration. 

In the FFX processing, the data is divided by single 
segment length of the second-stage FFT. From table 3, 
the input signals are divided into: 



1024 x 4096/8= 512 k, 



(17) 



where k is 1024 (2 10 ). This value determines the one seg- 
ment length of this FFT processing. 64-parallcl signals 
are converted to 4-parallel after being output to each dis- 
tribution buffer (sec figure 6). 

5.2. Correlation Board (CORB) 

5.2.1. Operation Format 

The operations of the correlation processing are per- 
formed using 16-bit floating point format. FFT calcula- 
tion is expressed as 

-I s -2 e - 16 -(l + ma/1024), (18) 

and the range is from ± 1/32768 to 131008. When the in- 
dex (e) is 0, it is regarded as instead of 2 -16 irrespective 
of mantissa. 

This not only simplifies the processing but also improves 
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Fig. 7. Conversion between 16-bit floating point and IEEE 
single precision floating point. 
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Fig. 8. Parallel processing of two datasets in FFT. 

the compatibility with IEEE single precision floating point 
data. IEEE single precision floating point has 32-bit rep- 
resenting sign (S): 1 bit, exponential (E): 8 bits, and 
mantissa (Mo): 23 bits, and is expressed as 



-l s -2 B,127 -(l + M/223), 
and they have following relations 

S = s, 

E = e+lll, 
Ma = max 8192(2 13 ). 



(19) 

(20) 
(21) 
(22) 



Conversion between 16-bit floating point and IEEE single 
precision floating point is also shown in figure 7. 

5.2.2. Parallel processing of two datasets 

Since FFT is the linear response, the real and imaginary 
parts of the input can be used separately. Since the signals 
received by an antenna are real part only, two datasets 
are combined into one complex datasct for the FFT pro- 
cessing (see figure 8). One dataset is inserted into a real 
part, and then next dataset is inserted into an imaginary 
part. These two datasets are shifted with onc-FFT seg- 
ment length (M), and then processed as one complex data. 
Combination of two datasets is performed by Read/ Write 
sequence control of the received buffer (=input buffer in 
FPGA). This method can reduce the material size of the 
correlator. 

5.2.3. F- Block (FX mode) 

The FX processing is performed with 16fc-point FFT. 
In terms of the throughrate, pipeline processing is applied 
to each FFT stage processing. To reduce the memory size 
in the pipeline processing, 16fc-point FFT is realized by 
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Fig. 10. Pipeline processing of the FFX scheme (lfc-point 
FFT, 8-point IFFT, 4fc-point FFT). 

being divided into three parts: 32-point FFT, 32-point 
FFT, and 16-point FFT (see figure 9). The input signals 
are multiplied by the window function between the input 
buffer and FFT, so that the input buffer size is based on 
the 3-bit signals. FFT is realized by changing the twid- 
dle factor according to the processing stages. This helps 
minimize the ROM of the twiddle factors. 

5.2.4. F-Block (FFX mode) 

The FFX mode is shown in figure 10. The first-stage 
FFT (lfc-point FFT) is performed in the same manner as 
32 points x 32 points of FX mode. 

The second-stage FFT (4fc-point FFT) is serially pro- 
cessed with one butterfly computing unit. Since 2x8- 
point data is obtained every single process of lk-point 
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Fig. 11. Sampling bit level. 
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Fig. 12. Sampling bit level at 3-bit length. 

first-stage FFT, 4096-point data is obtained by repeating 
the process of the first-stage FFT 256 times. The required 
time for this process is: 512 x 256 = 128fc [CLK]. On the 
other hand, the required time for the process of 4096-point 
FFT using one butterfly computing unit is: 



4096 x log 2 (4096) /2 = 24fc [CLK] . 



(23) 



Compared with these processing times, the processing 
time of 4096-point FFT with one butterfly computing unit 
is much shorter. 

5.2.5. Window function processing 

After re-allocated to two datasets, the signals are mul- 
tiplied by window function. Any of the following win- 
dow functions are selected: None (rectangular window), 
Harming, Hamming, and Blackmail. Window functions 
are generated in the CPU according to the selected 
command ("WINDOW") from the Monitor & Control 
Computer before the correlation process starts. 

5.2.6. Data conversion 

The input signals go through 3-bit processing. By 
setting a command (" SMPLBIT") from the Monitor & 
Control Computer, the input signals are processed as 1- 
bit or 2-bit data. Also, conversion to 1-bit data with (M) 
middle bit only is available. Assumed threshold value for 
each bit of sampling data is given as shown in figure 11. 
Based on the above threshold value, converted value of the 
input data is calculated as shown in figure 12. The input 
signals are converted into 16-bit floating point operation 
format before FFT. 
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Fig. 13. Block diagram of the correlation in a FX mode. 
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Fig. 14. Block diagram of the correlation in a FFX mode. 

5.2.7. X-block (FX mode) 

The composition of X-block in normal FX processing is 
shown in figure 13. Correlation processing is performed 
in 8fc-channel either of USB or LSB. For improving the 
sensitivity loss (see table 6 of Okumura et al. 2001), 2- 
channel frequency binning is performed to generate Ak- 
channel data (Frq. bin). Integration of 100 milliseconds is 
conducted in time integration (S.T.Ing). If the integration 
takes longer than 100 milliseconds, it is performed in the 
Long-Term Accumulation and Output Board (LTAOB) at 
the time of 16-parallel data synthesis. 

In OTF mode, the data is compressed from 8k fre- 
quency channels to 32 frequency channels by frequency 
binning. With time component of 1 millisecond, the corre- 
lation buffer area is changed every 1 millisecond. The out- 
put to the Long-Term Accumulation and Output Board 
(LTAOB) is performed every 100 millisecond. 

Prior to the correlation processing, the data from the 
F-block needs some pre-processing such as split of two 
datasets, USB/LSB (upper and lower sideband) selection, 
and 90-degree and 180-degree phase switching demodula- 
tions. In splitting process of two datasets, the input data 
are added and subtracted, and then the two-data compo- 
nents combined in the F-block are split. When LSB is 
selected, the sign of the split Imag component is inversed. 
On the other hand, when USB is selected, the split data 
is sent to the correlation process without any additional 
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Fig. 16. AW correction circuit. 

processing. In 90-degree and 180-degree phase-switching 
demodulations, switching between Real and Imag, and 
sign inversion are performed according to the two phase- 
switching signals. 

5.2.8. X-block (FFX mode) 

X-block in the FFX processing is shown in figure 14. In 
the FFX processing, X-block has a dual structure consid- 
ering the processing speed of the device. The first-stage 
FFT is the same as that of FX mode, however frequency 
binning is not performed. Split of two datasets in the 
second-stage FFT is not performed in X-block, since the 
process is performed in F-block. The results of the second 
stage FFT are output together with USB and LSB. Two- 
channel frequency binning is performed for improving the 
sensitivity loss (see table 6 of Okumura et al. 2001). 

5.2.9. AW correction 

AW correction is performed based on the baseline. AW 
correction coefficient is generated within FPGA. Firstly, 
A W (t) , the gradient of AW to time change, is evaluated 
and then the value of each frequency channel is calculated 
(see figure 15). Circuit diagram of the line graphs above 
is shown in figure 16. 

For every 1 millisecond, the initial value (Init) is pro- 
vided by the Monitor & Control Computer, and read as 
a initial gradient [AW / o(0)]. In the border between seg- 
ments, the previous value is multiplied by the gradient 
(Grad) data to generate a gradient of a new segment 
[AWo(i)]- In the arbitrary time (t), initial value (the value 
of DC) is set to 0. AW is set for every 128 channel in the 
full bandwidth of 8k channel, which means the full band- 
width (8k channel) is corrected with 64 steps. The initial 
value of AWo(i) can be set for every 1 millisecond, how- 
ever, a given value is set within 128 channel. The gradient 
of AWo(i) is the gradient variation of AW per one seg- 
ment length for FFT. The Monitor & Control Computer 
specifies a set of the initial value and gradient approxi- 
mately every 1 second. 
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Fig. 17. Long-Term Accumulation/Output Board. 

5.3. Long-Term Accumulation/ Output Board (LTAOB) 

Final correlation value is calculated by adding the out- 
put of 16 correlators. The composition of the Long-Term 
Accumulation/Output Board (LTAOB) is shown in fig- 
ure 17. When the correlation result is output, the data is 
converted into the format of IEEE single-precision floating 
point. 

In the FX processing, the number of the output fre- 
quency channels is normally 4 k (=4096). Thus the output 



data size per one correlation is 

4fc x 32 x 2 x 3 = 768fc bits, (24) 

where 32 is the number of single-precision floating bits, 
2 is the complex, and 3 is the number of correlations; 
auto-correlations of x and y, and a cross-correlation be- 
tween them. Assuming the minimum integration time is 
0.1 second, the estimated output speed is approximately 
7.7 Mbps. In the FFX processing, the number of the out- 
put frequency channels is 512 + 2 k, thus the data size 
per one correlation is 480 k bits. Consequently, the es- 
timated output speed with 0.1-second integration time is 
approximately 4.8 Mbps. Correlation results are sent to 
the Monitor & Control Computer using TCP /IP protocol 
of 100 BaseT-Ether. 

5.4- Operation mode 

The FFX correlator are divided into four F parts in 
principle (not physically). Each F part in a FX mode 
and the first FFT stage of FFX mode are operated at 
2048 MHz, and each F part at the second FFT stage of 
FFX mode are operated at 32 MHz. Also, the digital 
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Fig. 18. FX correlator under development. There are two modules: one is the DTS-R module (top: left is a drawing of the front 
view, and right is a photograph), having a size of (W) 490 mm X (H) 500mm X (D) 500 mm; the other one is the Correlation 
module (bottom: left is a drawing of the front view, and right is a photograph), having a size of (W) 490 mm X (H) 500mm X (D) 
500 mm. The total size of the GDFB is (W) 490 mm X (H) 1100mm X (D) 500 mm, including the two power supply modules. 



signals input from EIB are distributed to arbitrary F parts 
by using the command ("FCHSEL") from the Monitor 
& Control Computer. In FX mode and the first FFT 
stage of FFX mode, the digital signals of 8192 Msps are 
operated by combining two F parts (= 2x2048 MHz), 
while the signals of 4096 Msps arc operated by using one 
parts (=1x2048 MHz). At the second FFT stage of FFX 
mode, the digital signals of 8192 Msps are operated by 
combining two F parts (=2x32 MHz), while the signals 
of 4096 Msps are operated using only one parts (=1x32 
MHz). By using the command ("FCHSEL"), the effective 
bandwidth after the second FFT stage can be changed. 

All of the operation modes available in this FFX corre- 
lator are listed in table 4 and 5. 

5. 5. Hardware of the FX Correlator 

The Hardware of the FX Correlator is shown in fig- 
ure 18. The DTS-R module consists of two Electrical in- 
put Interface Boards (EIBs), two Delay Correction and 



Data Configuration Boards (DCDCBs), and one DTS-R 
Monitor & Control Boards (DRMCBs). The Correlation 
module consists of eight Correlation Boards (CORBs), 
Long-Term Accumulation/Output Board (LTAOB), and 
one Correlation Monitor & Control Board(CORMCB). 
Each module is connected to the independent Power mod- 
ule. 

The power consumption of the filter module is 400 W, 
while that of the output module is 600 W. The total AC 
power is 750 W at 1-phase 100-220 VAC 50/60 Hz (100 V 
±10% or 220 V ±10%). The total weight is 71.3 kg. 

5.6. Evaluation and Discussion 

Figure 19 shows a block diagram of the measurement 
setup of the frequency response of the FFX Correlator. 
To investigate the frequency response, it is useful to use 
the CW signal, which can measure the folding effects by 
sweeping the frequency range of to 4096 MHz. The white 
noise is important to measure the frequency response, be- 
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Fig. 19. Block diagram of the measurement setup. A de- 
tailed block diagram of the FFX correlator is shown in figure 
3. The white noise is generated by the ASTE analog backend 
subsystem, while the CW signal is produced by the Agilent 
E4440B, which is locked in phase to a standard signal of 10 
MHz generated by the reference clock generator. The ADC 
1 and ADC 2 with 8192 Msps are also locked in phase to 
a signal of 8192 MHz generated by the reference clock gen- 
erator. Note that the ADCs were developed specifically for 
ASTE (Okuda et al. 2008), and all output signals from the 
reference clock generator are generated by the built-in 5MHz 
crystal oscillator (OCXO). 

cause the astronomical signals obey the Gaussian random 
variable. To obtain the input signals that are approxi- 
mated to the zero-mean Gaussian probability, mixing of 
the CW signal with the white noise is necessary. 

The frequency response of the FFX correlator when CW 
is included is written as 



Po„(/)=aon-[|C(/)| 2 + |iV(/)| 2 ] 



(25) 



where C(/) is the frequency response of the CW signal, 
N(f) is the frequency response of the white noise from the 
ASTE analog backend subsystem, Hi and H 2 are the fre- 
quency response by different transmission paths, in which 
Hi = H t iH a i and H 2 = H t2 H a2 (see figure 19), and i? D 
is the frequency response of the FFX correlator, including 
the effects of requantization and the folding noise after 
downsampling. The bandpass calibration is essential for 
estimating the CW power accurately, because the band- 
pass response becomes a time-variable due to outdoor air 
temperature. The frequency response without the CW 
signal is written as 

Poff (/) = floff ■ |^V(/)| 2 ■ Hi{f)H* 2 {f) ■ \H D (f)\ 2 . (26) 

The ADCs work as 1-bit performance (Okuda et al. 2008). 
In that case, it is important to adjust the power as pre- 
cisely as possible so as to avoid the high-order spurious 
effects. The frequency responses of P on {f) and -P ff(/) 
depend on the relative power of the CW signal to the 
white noise, and also the threshold levels in quantization. 
To correct these effects, we need to calibrate the bandpass 
by sensitively adjusting the continuum floor level of P s 
to that of P on in the data analysis. These values are a on 
and a ff- 

From equations (25) and (26), the frequency response 
in a FFX mode is written as 

Pjn(f) 

= a oa ■ [|C(/)| 2 + |iV(/)| 2 l ■ Hi(f)H* 2 (f) ■ |^(/)| 2 ,(27) 



while the frequency response without the CW signal can 
be written as 



F D F ff (/) = a oS ■ \N(f)\ 2 • ffi (/)#!(/) ■ 



(28) 



and then the CW frequency response including the re- 
sponse of the measurement system is derived as 



a Q ff 



a on -\C(f)\ 2 -Hi(f)H* 2 (f)-\Hl(f)\ 



(29) 



Similarly, the frequency response in a FX mode is written 
as 

Pon(f) 

= a on ■ [|C(/)| 2 + \N(f)\ 2 ] ■ Hi{f)H* 2 {f) ■ K(/)| 2 (30) 

while the frequency response without the CW signal can 
be written as 

PlU) = a oS ■ \N{f)\ 2 -Hi(f)H* 2 (f) ■ \H$(f)\ 2 , (31) 

and then the CW frequency response including the re- 
sponse of the measurement system is derived as 



Ooff 



a on -\C{f)\ 2 -Hi{f)H* 2 {f)-\H^{f)\ 



(32) 



From equations (29) and (32), we can derive the fre- 
quency response in a FFX mode from the correlated spec- 
tra obtained in FFX and FX modes as 



\HE(f)\ 2 = 



P F (f) 

pN (/) 



\H£(f)\ 



(33) 



Since the frequency response of H^(f) in the FX mode 
is well known (Thompson et al. 2001), the frequency 
response in a FFX mode can be derived finally. 

Figure 20 shows the frequency response of the FFX cor- 
relator at a bandwidth of 64 MHz in the range of 2048 to 
2112 MHz, and that the lower limits of about -33 dB to 
-40 dB are successfully measured with this method. The 
measurement frequency resolution was 31.25 kHz. The 
measurement results show that the effective bandwidth is 
about 59.28 MHz, which was obtained by the passband 
responses of about -1 dB at 2046.40625 and 2106.28125 
MHz, -3 dB at 2045.3125 and 2107.28125 MHz, and by 
a stopband response with the first sidelobc of about —20 
dB. The measurement results are well consistent with the 
theoretical curve in the passband, both bandedges (sharp- 
ness), and first sidelobe levels. In the stopband response 
except these responses, however, it is shown that there 
are differences between theoretical curve and measured 
results. This problem is probably due to the non-linear 
response of 1-bit ADC and the precision of the data re- 
duction process in this measurement method. The cross- 
modulation distortion is strongly generated in digitizing 
the CW signals at 1 bit. This character complicates the 
data reduction method, and will reduce the measurement 
precision. If the ADCs with 3 bits or more are feasible, 
this problem will be relaxed. 
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Fig. 20. The measured frequency response of the FFX correlator at a bandwidth of 64 MHz in a frequency rage of 2048 to 2112 
MHz. First sidelobe of about —20 dB and other stopband response of better than —23 dB were successfully achieved. A comparison 
between the measured frequency response of the FFX correlator (points) and the ideal frequency response of the FFX scheme (full 
line) in a rectangular window function without zeroing [see equations (14) and (11)] are shown. 



Finally, the measurement results show that the theory 
of the FFX scheme can be confirmed, and the development 
of the FFX Correlator was successfully realized. 

6. Summary 

There are two basic designs of a digital correlator: the 
XF-typc in which the cross-correlation is calculated before 
Fourier transformation, and the FX-typc in which Fourier 
transformation is performed before cross multiplication. 
To improve the FX-type correlator, we established a new 
algorithm for correlation process, that is called the FFX 
scheme. The FFX scheme demonstrates that the realiza- 
tion of a stopband response with first and second sidelobcs 
of —34 dB and higher-order sidelobes of —60 dB is techni- 
cally feasible. The FFX scheme consists of 2-stage Fourier 
Transform blocks, which perform the lst-stage Fourier 
Transform as a digital filter, and the 2nd-stage Fourier 
Transform to achieve higher dispersion. The FFX scheme 
provides flexibility in the setting of bandwidth within the 
sampling frequency. 

The input data rate of the developed FFX correlator is 
about 48 Giga bit per second (Gbps) with 3-bit quanti- 
zation at the sampling frequency of 8192 or 4096 Msps, 
which is 8192 Msps x 3 bits x 2 IF or 4096 Msps x 3 bits 
x 4 IFs. We have successfully evaluated the feasibilities of 
the FFX correlator hardware. Also, this hardware will be 



installed and operated as a new spectrometer for ASTE. 

We successfully developed the FFX correlator, mea- 
sured its performances, and demonstrated the capability 
of a wide-frequency coverage and high-frequency resolu- 
tion of the correlation systems. Our development and 
measurement results will also be useful and helpful in de- 
signing and developing the next generation correlator. 
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